EPPS 6356 Data Visualization Project
This storyboard delivers our final project product by visualizing Formula 1 Racing data focused on analyzing information on the drivers and different circuits.
Different drivers’ race win records across various circuits are presented here. The legend on the bottom lists the circuits in the data set and then the bar chart visualizes the number of wins for each driver.
Multiple linear regression was utilized to determine which factors
are important to evaluate the best driver.
Wins = b0 + (Pole Wins) X1 + (Total Points) X2 + (Fastest Laps) X3 +
(Podiums) X4 + e
Note: b0 is the intercept of the regression line and e is the model
error (residuals) or the variation in the model
R^2 = 0.9884, p-value = 8.821e-08
All factors were significant except Fastest Laps. Tried to evaluate the
height factor, however, the p-value was truly not significant since the
p-value was 0.895225.
The residual values are not completely normally distributed. This
histogram is skewed a bit at the ends. In the normal Q-Q plot, the
normality appears to be more clear because the values follow a straight
line.
Fernando Alonso has the most laps/distance covered in F1 Racing and so this visual demonstrates his record breaking results.
(graph 4 description here)
(graph 5 description here)
(graph 6 description here)
(graph 7 description here)
(graph 8 description here)
(graph 9 description here)
Line Chart to represent Total Points in 2021 for 6 different racers
(graph 11 description here)
(graph 12 description here)